Overview
Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 999 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 180.6 KiB |
| Average record size in memory | 185.1 B |
Variable types
| Numeric | 15 |
|---|---|
| Categorical | 3 |
bathrooms is highly overall correlated with floors and 5 other fields | High correlation |
bedrooms is highly overall correlated with sqft_above and 1 other fields | High correlation |
floors is highly overall correlated with bathrooms and 1 other fields | High correlation |
grade is highly overall correlated with bathrooms and 4 other fields | High correlation |
lat is highly overall correlated with price | High correlation |
long is highly overall correlated with zipcode | High correlation |
price is highly overall correlated with grade and 4 other fields | High correlation |
sqft_above is highly overall correlated with bathrooms and 6 other fields | High correlation |
sqft_living is highly overall correlated with bathrooms and 5 other fields | High correlation |
sqft_living15 is highly overall correlated with bathrooms and 4 other fields | High correlation |
view is highly overall correlated with waterfront | High correlation |
waterfront is highly overall correlated with view | High correlation |
yr_built is highly overall correlated with bathrooms | High correlation |
zipcode is highly overall correlated with long | High correlation |
waterfront is highly imbalanced (93.3%) | Imbalance |
view is highly imbalanced (72.0%) | Imbalance |
sqft_basement has 598 (59.9%) zeros | Zeros |
yr_renovated has 958 (95.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-12-23 04:22:54.166647 |
|---|---|
| Analysis finished | 2025-12-23 04:23:50.604928 |
| Duration | 56.44 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
bedrooms
Real number (ℝ)
High correlation
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3493493 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.85236737 |
|---|---|
| Coefficient of variation (CV) | 0.25448745 |
| Kurtosis | 0.83306099 |
| Mean | 3.3493493 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.35423566 |
| Sum | 3346 |
| Variance | 0.72653014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 491 | |
| 4 | 305 | |
| 2 | 114 | 11.4% |
| 5 | 69 | 6.9% |
| 6 | 11 | 1.1% |
| 1 | 7 | 0.7% |
| 7 | 1 | 0.1% |
| 0 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1 | 0.1% |
| 1 | 7 | 0.7% |
| 2 | 114 | 11.4% |
| 3 | 491 | |
| 4 | 305 | |
| 5 | 69 | 6.9% |
| 6 | 11 | 1.1% |
| 7 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 7 | 1 | 0.1% |
| 6 | 11 | 1.1% |
| 5 | 69 | 6.9% |
| 4 | 305 | |
| 3 | 491 | |
| 2 | 114 | 11.4% |
| 1 | 7 | 0.7% |
| 0 | 1 | 0.1% |
bathrooms
Real number (ℝ)
High correlation
| Distinct | 19 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0457958 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1.5 |
| median | 2 |
| Q3 | 2.5 |
| 95-th percentile | 3.25 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.72198328 |
|---|---|
| Coefficient of variation (CV) | 0.35291073 |
| Kurtosis | 0.39664366 |
| Mean | 2.0457958 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 0.30866374 |
| Sum | 2043.75 |
| Variance | 0.52125986 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.5 | 238 | |
| 1 | 187 | |
| 1.75 | 159 | |
| 2.25 | 104 | |
| 2 | 98 | |
| 1.5 | 61 | 6.1% |
| 2.75 | 51 | 5.1% |
| 3 | 33 | 3.3% |
| 3.5 | 28 | 2.8% |
| 3.25 | 18 | 1.8% |
| Other values (9) | 22 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 1 | 0.1% |
| 0.75 | 6 | 0.6% |
| 1 | 187 | |
| 1.25 | 1 | 0.1% |
| 1.5 | 61 | 6.1% |
| 1.75 | 159 | |
| 2 | 98 | |
| 2.25 | 104 | |
| 2.5 | 238 | |
| 2.75 | 51 | 5.1% |
| Value | Count | Frequency (%) |
| 5 | 2 | 0.2% |
| 4.75 | 1 | 0.1% |
| 4.5 | 2 | 0.2% |
| 4.25 | 4 | 0.4% |
| 4 | 4 | 0.4% |
| 3.75 | 1 | 0.1% |
| 3.5 | 28 | |
| 3.25 | 18 | 1.8% |
| 3 | 33 | |
| 2.75 | 51 |
sqft_living
Real number (ℝ)
High correlation
| Distinct | 321 |
|---|---|
| Distinct (%) | 32.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2051.3974 |
| Minimum | 380 |
|---|---|
| Maximum | 6070 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 380 |
|---|---|
| 5-th percentile | 990 |
| Q1 | 1405 |
| median | 1900 |
| Q3 | 2475 |
| 95-th percentile | 3830 |
| Maximum | 6070 |
| Range | 5690 |
| Interquartile range (IQR) | 1070 |
Descriptive statistics
| Standard deviation | 888.35111 |
|---|---|
| Coefficient of variation (CV) | 0.43304682 |
| Kurtosis | 2.0253918 |
| Mean | 2051.3974 |
| Median Absolute Deviation (MAD) | 530 |
| Skewness | 1.2472064 |
| Sum | 2049346 |
| Variance | 789167.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1250 | 11 | 1.1% |
| 1510 | 11 | 1.1% |
| 1300 | 10 | 1.0% |
| 1490 | 10 | 1.0% |
| 1330 | 10 | 1.0% |
| 2020 | 9 | 0.9% |
| 2160 | 9 | 0.9% |
| 1430 | 9 | 0.9% |
| 1670 | 9 | 0.9% |
| 1400 | 9 | 0.9% |
| Other values (311) | 902 |
| Value | Count | Frequency (%) |
| 380 | 1 | 0.1% |
| 430 | 1 | 0.1% |
| 560 | 1 | 0.1% |
| 630 | 2 | |
| 700 | 1 | 0.1% |
| 720 | 1 | 0.1% |
| 740 | 1 | 0.1% |
| 750 | 2 | |
| 760 | 1 | 0.1% |
| 770 | 3 |
| Value | Count | Frequency (%) |
| 6070 | 1 | 0.1% |
| 6050 | 2 | |
| 5420 | 1 | 0.1% |
| 5403 | 1 | 0.1% |
| 5310 | 1 | 0.1% |
| 5180 | 1 | 0.1% |
| 5050 | 1 | 0.1% |
| 4890 | 1 | 0.1% |
| 4870 | 1 | 0.1% |
| 4860 | 3 |
sqft_lot
Real number (ℝ)
| Distinct | 828 |
|---|---|
| Distinct (%) | 82.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14707.242 |
| Minimum | 649 |
|---|---|
| Maximum | 315374 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 649 |
|---|---|
| 5-th percentile | 2871.5 |
| Q1 | 5419 |
| median | 8040 |
| Q3 | 11508.5 |
| 95-th percentile | 40445.2 |
| Maximum | 315374 |
| Range | 314725 |
| Interquartile range (IQR) | 6089.5 |
Descriptive statistics
| Standard deviation | 28975.077 |
|---|---|
| Coefficient of variation (CV) | 1.9701231 |
| Kurtosis | 41.996043 |
| Mean | 14707.242 |
| Median Absolute Deviation (MAD) | 2920 |
| Skewness | 6.0808509 |
| Sum | 14692535 |
| Variance | 8.395551 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5000 | 27 | 2.7% |
| 6000 | 12 | 1.2% |
| 4000 | 10 | 1.0% |
| 9600 | 7 | 0.7% |
| 9000 | 6 | 0.6% |
| 3000 | 5 | 0.5% |
| 7500 | 5 | 0.5% |
| 7200 | 5 | 0.5% |
| 5400 | 5 | 0.5% |
| 8400 | 5 | 0.5% |
| Other values (818) | 912 |
| Value | Count | Frequency (%) |
| 649 | 1 | |
| 1016 | 1 | |
| 1044 | 1 | |
| 1058 | 1 | |
| 1066 | 1 | |
| 1086 | 1 | |
| 1091 | 1 | |
| 1100 | 1 | |
| 1102 | 1 | |
| 1140 | 1 |
| Value | Count | Frequency (%) |
| 315374 | 1 | |
| 262018 | 1 | |
| 230652 | 1 | |
| 221284 | 1 | |
| 219978 | 1 | |
| 218252 | 1 | |
| 217800 | 1 | |
| 217014 | 1 | |
| 213444 | 1 | |
| 209959 | 1 |
floors
Real number (ℝ)
High correlation
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4469469 |
| Minimum | 1 |
|---|---|
| Maximum | 3.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 3.5 |
| Range | 2.5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.51742017 |
|---|---|
| Coefficient of variation (CV) | 0.35759443 |
| Kurtosis | -0.43785566 |
| Mean | 1.4469469 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.68618829 |
| Sum | 1445.5 |
| Variance | 0.26772364 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 528 | |
| 2 | 354 | |
| 1.5 | 93 | 9.3% |
| 3 | 18 | 1.8% |
| 2.5 | 5 | 0.5% |
| 3.5 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 528 | |
| 1.5 | 93 | 9.3% |
| 2 | 354 | |
| 2.5 | 5 | 0.5% |
| 3 | 18 | 1.8% |
| 3.5 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 3.5 | 1 | 0.1% |
| 3 | 18 | 1.8% |
| 2.5 | 5 | 0.5% |
| 2 | 354 | |
| 1.5 | 93 | 9.3% |
| 1 | 528 |
waterfront
Categorical
High correlation Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 47.9 KiB |
| 0.0 | |
|---|---|
| 1.0 | 8 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 991 | |
| 1.0 | 8 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 991 | |
| 1.0 | 8 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1990 | |
| . | 999 | |
| 1 | 8 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2997 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1990 | |
| . | 999 | |
| 1 | 8 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2997 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1990 | |
| . | 999 | |
| 1 | 8 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2997 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1990 | |
| . | 999 | |
| 1 | 8 | 0.3% |
view
Categorical
High correlation Imbalance
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 47.9 KiB |
| 0.0 | |
|---|---|
| 2.0 | 46 |
| 3.0 | 26 |
| 1.0 | 15 |
| 4.0 | 13 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 899 | |
| 2.0 | 46 | 4.6% |
| 3.0 | 26 | 2.6% |
| 1.0 | 15 | 1.5% |
| 4.0 | 13 | 1.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 899 | |
| 2.0 | 46 | 4.6% |
| 3.0 | 26 | 2.6% |
| 1.0 | 15 | 1.5% |
| 4.0 | 13 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1898 | |
| . | 999 | |
| 2 | 46 | 1.5% |
| 3 | 26 | 0.9% |
| 1 | 15 | 0.5% |
| 4 | 13 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2997 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1898 | |
| . | 999 | |
| 2 | 46 | 1.5% |
| 3 | 26 | 0.9% |
| 1 | 15 | 0.5% |
| 4 | 13 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2997 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1898 | |
| . | 999 | |
| 2 | 46 | 1.5% |
| 3 | 26 | 0.9% |
| 1 | 15 | 0.5% |
| 4 | 13 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2997 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1898 | |
| . | 999 | |
| 2 | 46 | 1.5% |
| 3 | 26 | 0.9% |
| 1 | 15 | 0.5% |
| 4 | 13 | 0.4% |
condition
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 47.9 KiB |
| 3.0 | |
|---|---|
| 4.0 | |
| 5.0 | |
| 2.0 | 6 |
| 1.0 | 3 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 3.0 |
| 4th row | 5.0 |
| 5th row | 3.0 |
Common Values
| Value | Count | Frequency (%) |
| 3.0 | 612 | |
| 4.0 | 280 | |
| 5.0 | 98 | 9.8% |
| 2.0 | 6 | 0.6% |
| 1.0 | 3 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3.0 | 612 | |
| 4.0 | 280 | |
| 5.0 | 98 | 9.8% |
| 2.0 | 6 | 0.6% |
| 1.0 | 3 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 999 | |
| 0 | 999 | |
| 3 | 612 | |
| 4 | 280 | 9.3% |
| 5 | 98 | 3.3% |
| 2 | 6 | 0.2% |
| 1 | 3 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2997 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 999 | |
| 0 | 999 | |
| 3 | 612 | |
| 4 | 280 | 9.3% |
| 5 | 98 | 3.3% |
| 2 | 6 | 0.2% |
| 1 | 3 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2997 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 999 | |
| 0 | 999 | |
| 3 | 612 | |
| 4 | 280 | 9.3% |
| 5 | 98 | 3.3% |
| 2 | 6 | 0.2% |
| 1 | 3 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2997 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 999 | |
| 0 | 999 | |
| 3 | 612 | |
| 4 | 280 | 9.3% |
| 5 | 98 | 3.3% |
| 2 | 6 | 0.2% |
| 1 | 3 | 0.1% |
grade
Real number (ℝ)
High correlation
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.6056056 |
| Minimum | 4 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 7 |
| median | 7 |
| Q3 | 8 |
| 95-th percentile | 10 |
| Maximum | 12 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1607339 |
|---|---|
| Coefficient of variation (CV) | 0.15261558 |
| Kurtosis | 1.3395351 |
| Mean | 7.6056056 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.83089173 |
| Sum | 7598 |
| Variance | 1.3473032 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 445 | |
| 8 | 266 | |
| 9 | 112 | 11.2% |
| 6 | 91 | 9.1% |
| 10 | 47 | 4.7% |
| 11 | 18 | 1.8% |
| 5 | 13 | 1.3% |
| 12 | 5 | 0.5% |
| 4 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 4 | 2 | 0.2% |
| 5 | 13 | 1.3% |
| 6 | 91 | 9.1% |
| 7 | 445 | |
| 8 | 266 | |
| 9 | 112 | 11.2% |
| 10 | 47 | 4.7% |
| 11 | 18 | 1.8% |
| 12 | 5 | 0.5% |
| Value | Count | Frequency (%) |
| 12 | 5 | 0.5% |
| 11 | 18 | 1.8% |
| 10 | 47 | 4.7% |
| 9 | 112 | 11.2% |
| 8 | 266 | |
| 7 | 445 | |
| 6 | 91 | 9.1% |
| 5 | 13 | 1.3% |
| 4 | 2 | 0.2% |
sqft_above
Real number (ℝ)
High correlation
| Distinct | 291 |
|---|---|
| Distinct (%) | 29.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1750.2332 |
| Minimum | 380 |
|---|---|
| Maximum | 6070 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 380 |
|---|---|
| 5-th percentile | 869 |
| Q1 | 1190 |
| median | 1540 |
| Q3 | 2135 |
| 95-th percentile | 3300 |
| Maximum | 6070 |
| Range | 5690 |
| Interquartile range (IQR) | 945 |
Descriptive statistics
| Standard deviation | 790.46691 |
|---|---|
| Coefficient of variation (CV) | 0.45163518 |
| Kurtosis | 3.1296995 |
| Mean | 1750.2332 |
| Median Absolute Deviation (MAD) | 430 |
| Skewness | 1.4790359 |
| Sum | 1748483 |
| Variance | 624837.93 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1250 | 14 | 1.4% |
| 1010 | 14 | 1.4% |
| 1300 | 14 | 1.4% |
| 1330 | 12 | 1.2% |
| 1610 | 11 | 1.1% |
| 1320 | 11 | 1.1% |
| 1100 | 11 | 1.1% |
| 1000 | 10 | 1.0% |
| 1070 | 10 | 1.0% |
| 1130 | 10 | 1.0% |
| Other values (281) | 882 |
| Value | Count | Frequency (%) |
| 380 | 1 | 0.1% |
| 430 | 1 | 0.1% |
| 560 | 1 | 0.1% |
| 580 | 1 | 0.1% |
| 630 | 2 | |
| 670 | 1 | 0.1% |
| 700 | 4 | |
| 720 | 2 | |
| 740 | 1 | 0.1% |
| 750 | 2 |
| Value | Count | Frequency (%) |
| 6070 | 1 | |
| 6050 | 1 | |
| 5403 | 1 | |
| 5310 | 1 | |
| 4860 | 1 | |
| 4750 | 1 | |
| 4740 | 1 | |
| 4670 | 1 | |
| 4570 | 1 | |
| 4410 | 1 |
sqft_basement
Real number (ℝ)
Zeros
| Distinct | 140 |
|---|---|
| Distinct (%) | 14.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 301.16416 |
| Minimum | 0 |
|---|---|
| Maximum | 2060 |
| Zeros | 598 |
| Zeros (%) | 59.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 580 |
| 95-th percentile | 1223 |
| Maximum | 2060 |
| Range | 2060 |
| Interquartile range (IQR) | 580 |
Descriptive statistics
| Standard deviation | 451.0234 |
|---|---|
| Coefficient of variation (CV) | 1.4975998 |
| Kurtosis | 1.4506011 |
| Mean | 301.16416 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.4714516 |
| Sum | 300863 |
| Variance | 203422.11 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 598 | |
| 600 | 14 | 1.4% |
| 500 | 13 | 1.3% |
| 700 | 13 | 1.3% |
| 400 | 12 | 1.2% |
| 800 | 10 | 1.0% |
| 300 | 8 | 0.8% |
| 1040 | 7 | 0.7% |
| 530 | 7 | 0.7% |
| 1010 | 7 | 0.7% |
| Other values (130) | 310 |
| Value | Count | Frequency (%) |
| 0 | 598 | |
| 50 | 2 | 0.2% |
| 60 | 1 | 0.1% |
| 120 | 3 | 0.3% |
| 130 | 2 | 0.2% |
| 140 | 3 | 0.3% |
| 150 | 1 | 0.1% |
| 160 | 3 | 0.3% |
| 180 | 3 | 0.3% |
| 190 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 2060 | 1 | |
| 2000 | 1 | |
| 1950 | 2 | |
| 1900 | 1 | |
| 1830 | 1 | |
| 1820 | 1 | |
| 1810 | 1 | |
| 1800 | 1 | |
| 1780 | 1 | |
| 1760 | 2 |
yr_built
Real number (ℝ)
High correlation
| Distinct | 114 |
|---|---|
| Distinct (%) | 11.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1969.03 |
| Minimum | 1900 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1915 |
| Q1 | 1952 |
| median | 1974 |
| Q3 | 1992 |
| 95-th percentile | 2006 |
| Maximum | 2015 |
| Range | 115 |
| Interquartile range (IQR) | 40 |
Descriptive statistics
| Standard deviation | 28.198607 |
|---|---|
| Coefficient of variation (CV) | 0.014321065 |
| Kurtosis | -0.53885541 |
| Mean | 1969.03 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | -0.54476677 |
| Sum | 1967061 |
| Variance | 795.16142 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1954 | 25 | 2.5% |
| 2005 | 25 | 2.5% |
| 1977 | 20 | 2.0% |
| 1979 | 20 | 2.0% |
| 1994 | 19 | 1.9% |
| 1968 | 19 | 1.9% |
| 1978 | 19 | 1.9% |
| 2003 | 18 | 1.8% |
| 2004 | 18 | 1.8% |
| 1987 | 17 | 1.7% |
| Other values (104) | 799 |
| Value | Count | Frequency (%) |
| 1900 | 6 | |
| 1901 | 2 | 0.2% |
| 1902 | 1 | 0.1% |
| 1903 | 1 | 0.1% |
| 1904 | 3 | |
| 1905 | 5 | |
| 1907 | 2 | 0.2% |
| 1908 | 5 | |
| 1909 | 4 | |
| 1910 | 5 |
| Value | Count | Frequency (%) |
| 2015 | 1 | 0.1% |
| 2014 | 10 | |
| 2013 | 3 | 0.3% |
| 2012 | 3 | 0.3% |
| 2011 | 1 | 0.1% |
| 2010 | 6 | |
| 2009 | 4 | 0.4% |
| 2008 | 8 | |
| 2007 | 9 | |
| 2006 | 13 |
yr_renovated
Real number (ℝ)
Zeros
| Distinct | 25 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 81.830831 |
| Minimum | 0 |
|---|---|
| Maximum | 2014 |
| Zeros | 958 |
| Zeros (%) | 95.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2014 |
| Range | 2014 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 395.76792 |
|---|---|
| Coefficient of variation (CV) | 4.8364157 |
| Kurtosis | 19.518878 |
| Mean | 81.830831 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.6344404 |
| Sum | 81749 |
| Variance | 156632.24 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 958 | |
| 1991 | 3 | 0.3% |
| 2002 | 3 | 0.3% |
| 2013 | 3 | 0.3% |
| 2005 | 3 | 0.3% |
| 2003 | 3 | 0.3% |
| 2014 | 3 | 0.3% |
| 1978 | 2 | 0.2% |
| 1999 | 2 | 0.2% |
| 1990 | 2 | 0.2% |
| Other values (15) | 17 | 1.7% |
| Value | Count | Frequency (%) |
| 0 | 958 | |
| 1945 | 1 | 0.1% |
| 1954 | 1 | 0.1% |
| 1957 | 1 | 0.1% |
| 1974 | 1 | 0.1% |
| 1977 | 1 | 0.1% |
| 1978 | 2 | 0.2% |
| 1981 | 1 | 0.1% |
| 1983 | 1 | 0.1% |
| 1984 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 2014 | 3 | |
| 2013 | 3 | |
| 2011 | 1 | 0.1% |
| 2010 | 1 | 0.1% |
| 2008 | 1 | 0.1% |
| 2005 | 3 | |
| 2003 | 3 | |
| 2002 | 3 | |
| 1999 | 2 | |
| 1995 | 1 | 0.1% |
zipcode
Real number (ℝ)
High correlation
| Distinct | 69 |
|---|---|
| Distinct (%) | 6.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 98074.441 |
| Minimum | 98001 |
|---|---|
| Maximum | 98199 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 98001 |
|---|---|
| 5-th percentile | 98004 |
| Q1 | 98032 |
| median | 98058 |
| Q3 | 98116 |
| 95-th percentile | 98177 |
| Maximum | 98199 |
| Range | 198 |
| Interquartile range (IQR) | 84 |
Descriptive statistics
| Standard deviation | 52.545832 |
|---|---|
| Coefficient of variation (CV) | 0.00053577498 |
| Kurtosis | -0.73273415 |
| Mean | 98074.441 |
| Median Absolute Deviation (MAD) | 44 |
| Skewness | 0.51323728 |
| Sum | 97976367 |
| Variance | 2761.0645 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 98038 | 38 | 3.8% |
| 98006 | 32 | 3.2% |
| 98023 | 30 | 3.0% |
| 98052 | 29 | 2.9% |
| 98058 | 27 | 2.7% |
| 98133 | 27 | 2.7% |
| 98042 | 27 | 2.7% |
| 98103 | 27 | 2.7% |
| 98034 | 26 | 2.6% |
| 98115 | 26 | 2.6% |
| Other values (59) | 710 |
| Value | Count | Frequency (%) |
| 98001 | 16 | |
| 98002 | 8 | 0.8% |
| 98003 | 16 | |
| 98004 | 13 | |
| 98005 | 8 | 0.8% |
| 98006 | 32 | |
| 98007 | 4 | 0.4% |
| 98008 | 9 | 0.9% |
| 98010 | 7 | 0.7% |
| 98011 | 11 | 1.1% |
| Value | Count | Frequency (%) |
| 98199 | 7 | |
| 98198 | 14 | |
| 98188 | 7 | |
| 98178 | 16 | |
| 98177 | 10 | |
| 98168 | 13 | |
| 98166 | 16 | |
| 98155 | 9 | |
| 98148 | 2 | 0.2% |
| 98146 | 11 |
lat
Real number (ℝ)
High correlation
| Distinct | 895 |
|---|---|
| Distinct (%) | 89.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.54972 |
| Minimum | 47.1775 |
|---|---|
| Maximum | 47.7776 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 47.1775 |
|---|---|
| 5-th percentile | 47.31017 |
| Q1 | 47.443 |
| median | 47.5636 |
| Q3 | 47.6734 |
| 95-th percentile | 47.74792 |
| Maximum | 47.7776 |
| Range | 0.6001 |
| Interquartile range (IQR) | 0.2304 |
Descriptive statistics
| Standard deviation | 0.14155835 |
|---|---|
| Coefficient of variation (CV) | 0.0029770595 |
| Kurtosis | -0.87846204 |
| Mean | 47.54972 |
| Median Absolute Deviation (MAD) | 0.1141 |
| Skewness | -0.35819307 |
| Sum | 47502.17 |
| Variance | 0.020038766 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 47.7073 | 4 | 0.4% |
| 47.3608 | 3 | 0.3% |
| 47.6734 | 3 | 0.3% |
| 47.6597 | 3 | 0.3% |
| 47.3663 | 3 | 0.3% |
| 47.7145 | 3 | 0.3% |
| 47.4802 | 3 | 0.3% |
| 47.5123 | 2 | 0.2% |
| 47.3828 | 2 | 0.2% |
| 47.69 | 2 | 0.2% |
| Other values (885) | 971 |
| Value | Count | Frequency (%) |
| 47.1775 | 1 | |
| 47.1803 | 1 | |
| 47.1913 | 1 | |
| 47.1949 | 1 | |
| 47.1951 | 1 | |
| 47.1976 | 1 | |
| 47.2086 | 1 | |
| 47.2105 | 1 | |
| 47.2118 | 1 | |
| 47.2413 | 1 |
| Value | Count | Frequency (%) |
| 47.7776 | 1 | |
| 47.7767 | 1 | |
| 47.7751 | 1 | |
| 47.7738 | 1 | |
| 47.7736 | 2 | |
| 47.7734 | 1 | |
| 47.7731 | 1 | |
| 47.7728 | 1 | |
| 47.7727 | 1 | |
| 47.7721 | 1 |
long
Real number (ℝ)
High correlation
| Distinct | 405 |
|---|---|
| Distinct (%) | 40.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -122.20741 |
| Minimum | -122.49 |
|---|---|
| Maximum | -121.709 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 999 |
| Negative (%) | 100.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | -122.49 |
|---|---|
| 5-th percentile | -122.382 |
| Q1 | -122.3225 |
| median | -122.218 |
| Q3 | -122.118 |
| 95-th percentile | -121.9738 |
| Maximum | -121.709 |
| Range | 0.781 |
| Interquartile range (IQR) | 0.2045 |
Descriptive statistics
| Standard deviation | 0.13956378 |
|---|---|
| Coefficient of variation (CV) | -0.0011420239 |
| Kurtosis | 0.21344681 |
| Mean | -122.20741 |
| Median Absolute Deviation (MAD) | 0.102 |
| Skewness | 0.73404715 |
| Sum | -122085.2 |
| Variance | 0.019478049 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -122.188 | 9 | 0.9% |
| -122.191 | 8 | 0.8% |
| -122.198 | 7 | 0.7% |
| -122.371 | 7 | 0.7% |
| -122.372 | 7 | 0.7% |
| -122.125 | 7 | 0.7% |
| -122.319 | 7 | 0.7% |
| -122.381 | 7 | 0.7% |
| -122.353 | 7 | 0.7% |
| -122.286 | 7 | 0.7% |
| Other values (395) | 926 |
| Value | Count | Frequency (%) |
| -122.49 | 1 | 0.1% |
| -122.482 | 1 | 0.1% |
| -122.451 | 1 | 0.1% |
| -122.438 | 2 | |
| -122.411 | 1 | 0.1% |
| -122.409 | 1 | 0.1% |
| -122.405 | 1 | 0.1% |
| -122.402 | 3 | |
| -122.4 | 2 | |
| -122.396 | 3 |
| Value | Count | Frequency (%) |
| -121.709 | 1 | |
| -121.711 | 1 | |
| -121.714 | 1 | |
| -121.755 | 1 | |
| -121.758 | 1 | |
| -121.759 | 1 | |
| -121.771 | 1 | |
| -121.772 | 1 | |
| -121.776 | 1 | |
| -121.779 | 1 |
sqft_living15
Real number (ℝ)
High correlation
| Distinct | 267 |
|---|---|
| Distinct (%) | 26.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1986.8138 |
| Minimum | 830 |
|---|---|
| Maximum | 4760 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 830 |
|---|---|
| 5-th percentile | 1169 |
| Q1 | 1490 |
| median | 1850 |
| Q3 | 2360 |
| 95-th percentile | 3251 |
| Maximum | 4760 |
| Range | 3930 |
| Interquartile range (IQR) | 870 |
Descriptive statistics
| Standard deviation | 670.72347 |
|---|---|
| Coefficient of variation (CV) | 0.33758748 |
| Kurtosis | 1.0944439 |
| Mean | 1986.8138 |
| Median Absolute Deviation (MAD) | 410 |
| Skewness | 1.049913 |
| Sum | 1984827 |
| Variance | 449869.98 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1570 | 12 | 1.2% |
| 1580 | 11 | 1.1% |
| 1320 | 11 | 1.1% |
| 1560 | 11 | 1.1% |
| 1390 | 10 | 1.0% |
| 1460 | 10 | 1.0% |
| 1590 | 10 | 1.0% |
| 1440 | 10 | 1.0% |
| 1610 | 10 | 1.0% |
| 1660 | 10 | 1.0% |
| Other values (257) | 894 |
| Value | Count | Frequency (%) |
| 830 | 1 | 0.1% |
| 880 | 1 | 0.1% |
| 890 | 1 | 0.1% |
| 940 | 1 | 0.1% |
| 950 | 2 | |
| 970 | 1 | 0.1% |
| 980 | 1 | 0.1% |
| 1000 | 2 | |
| 1010 | 3 | |
| 1020 | 4 |
| Value | Count | Frequency (%) |
| 4760 | 1 | |
| 4680 | 1 | |
| 4550 | 1 | |
| 4300 | 1 | |
| 4230 | 1 | |
| 4210 | 1 | |
| 4190 | 1 | |
| 4180 | 1 | |
| 4110 | 1 | |
| 4100 | 1 |
price
Real number (ℝ)
High correlation
| Distinct | 580 |
|---|---|
| Distinct (%) | 58.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52.071452 |
| Minimum | 8 |
|---|---|
| Maximum | 308 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 47.9 KiB |
Quantile statistics
| Minimum | 8 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 30.98 |
| median | 43.5 |
| Q3 | 63.44625 |
| 95-th percentile | 110 |
| Maximum | 308 |
| Range | 300 |
| Interquartile range (IQR) | 32.46625 |
Descriptive statistics
| Standard deviation | 33.974907 |
|---|---|
| Coefficient of variation (CV) | 0.65246705 |
| Kurtosis | 13.948624 |
| Mean | 52.071452 |
| Median Absolute Deviation (MAD) | 14.6651 |
| Skewness | 2.9749338 |
| Sum | 52019.38 |
| Variance | 1154.2943 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 65 | 13 | 1.3% |
| 31.5 | 10 | 1.0% |
| 26 | 9 | 0.9% |
| 42.5 | 9 | 0.9% |
| 28 | 8 | 0.8% |
| 53 | 8 | 0.8% |
| 32 | 8 | 0.8% |
| 33 | 8 | 0.8% |
| 38.5 | 7 | 0.7% |
| 45 | 7 | 0.7% |
| Other values (570) | 912 |
| Value | Count | Frequency (%) |
| 8 | 1 | |
| 13 | 1 | |
| 14.75 | 1 | |
| 15.3 | 1 | |
| 15.7 | 1 | |
| 16 | 1 | |
| 16.35 | 1 | |
| 16.5 | 1 | |
| 16.66 | 1 | |
| 16.695 | 1 |
| Value | Count | Frequency (%) |
| 308 | 1 | 0.1% |
| 307 | 1 | 0.1% |
| 290 | 1 | 0.1% |
| 240 | 2 | |
| 238 | 1 | 0.1% |
| 225 | 3 | |
| 213 | 1 | 0.1% |
| 205 | 1 | 0.1% |
| 200 | 1 | 0.1% |
| 195 | 1 | 0.1% |
Interactions
Correlations
| bathrooms | bedrooms | condition | floors | grade | lat | long | price | sqft_above | sqft_basement | sqft_living | sqft_living15 | sqft_lot | view | waterfront | yr_built | yr_renovated | zipcode | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| bathrooms | 1.000 | 0.482 | 0.109 | 0.519 | 0.655 | 0.025 | 0.304 | 0.470 | 0.691 | 0.177 | 0.732 | 0.612 | 0.115 | 0.135 | 0.183 | 0.570 | 0.026 | -0.236 |
| bedrooms | 0.482 | 1.000 | 0.071 | 0.237 | 0.372 | -0.007 | 0.171 | 0.329 | 0.527 | 0.185 | 0.612 | 0.433 | 0.202 | 0.154 | 0.099 | 0.151 | 0.014 | -0.174 |
| condition | 0.109 | 0.071 | 1.000 | 0.162 | 0.163 | 0.067 | 0.095 | 0.000 | 0.065 | 0.078 | 0.008 | 0.049 | 0.079 | 0.000 | 0.000 | 0.242 | 0.067 | 0.096 |
| floors | 0.519 | 0.237 | 0.162 | 1.000 | 0.462 | 0.065 | 0.175 | 0.328 | 0.618 | -0.307 | 0.403 | 0.323 | -0.168 | 0.054 | 0.000 | 0.464 | 0.046 | -0.027 |
| grade | 0.655 | 0.372 | 0.163 | 0.462 | 1.000 | 0.128 | 0.232 | 0.626 | 0.695 | 0.133 | 0.721 | 0.682 | 0.171 | 0.147 | 0.180 | 0.484 | -0.004 | -0.185 |
| lat | 0.025 | -0.007 | 0.067 | 0.065 | 0.128 | 1.000 | -0.115 | 0.551 | 0.010 | 0.160 | 0.101 | 0.083 | -0.134 | 0.056 | 0.078 | -0.142 | 0.057 | 0.259 |
| long | 0.304 | 0.171 | 0.095 | 0.175 | 0.232 | -0.115 | 1.000 | 0.049 | 0.388 | -0.205 | 0.273 | 0.375 | 0.393 | 0.069 | 0.466 | 0.485 | -0.095 | -0.516 |
| price | 0.470 | 0.329 | 0.000 | 0.328 | 0.626 | 0.551 | 0.049 | 1.000 | 0.515 | 0.276 | 0.632 | 0.578 | 0.060 | 0.329 | 0.461 | 0.058 | 0.100 | 0.012 |
| sqft_above | 0.691 | 0.527 | 0.065 | 0.618 | 0.695 | 0.010 | 0.388 | 0.515 | 1.000 | -0.161 | 0.834 | 0.717 | 0.290 | 0.118 | 0.241 | 0.455 | 0.029 | -0.284 |
| sqft_basement | 0.177 | 0.185 | 0.078 | -0.307 | 0.133 | 0.160 | -0.205 | 0.276 | -0.161 | 1.000 | 0.350 | 0.176 | 0.050 | 0.218 | 0.182 | -0.177 | 0.075 | 0.060 |
| sqft_living | 0.732 | 0.612 | 0.008 | 0.403 | 0.721 | 0.101 | 0.273 | 0.632 | 0.834 | 0.350 | 1.000 | 0.789 | 0.314 | 0.186 | 0.167 | 0.335 | 0.035 | -0.229 |
| sqft_living15 | 0.612 | 0.433 | 0.049 | 0.323 | 0.682 | 0.083 | 0.375 | 0.578 | 0.717 | 0.176 | 0.789 | 1.000 | 0.374 | 0.198 | 0.218 | 0.356 | -0.016 | -0.295 |
| sqft_lot | 0.115 | 0.202 | 0.079 | -0.168 | 0.171 | -0.134 | 0.393 | 0.060 | 0.290 | 0.050 | 0.314 | 0.374 | 1.000 | 0.025 | 0.045 | 0.072 | -0.002 | -0.346 |
| view | 0.135 | 0.154 | 0.000 | 0.054 | 0.147 | 0.056 | 0.069 | 0.329 | 0.118 | 0.218 | 0.186 | 0.198 | 0.025 | 1.000 | 0.587 | 0.000 | 0.085 | 0.000 |
| waterfront | 0.183 | 0.099 | 0.000 | 0.000 | 0.180 | 0.078 | 0.466 | 0.461 | 0.241 | 0.182 | 0.167 | 0.218 | 0.045 | 0.587 | 1.000 | 0.000 | 0.000 | 0.176 |
| yr_built | 0.570 | 0.151 | 0.242 | 0.464 | 0.484 | -0.142 | 0.485 | 0.058 | 0.455 | -0.177 | 0.335 | 0.356 | 0.072 | 0.000 | 0.000 | 1.000 | -0.243 | -0.353 |
| yr_renovated | 0.026 | 0.014 | 0.067 | 0.046 | -0.004 | 0.057 | -0.095 | 0.100 | 0.029 | 0.075 | 0.035 | -0.016 | -0.002 | 0.085 | 0.000 | -0.243 | 1.000 | 0.089 |
| zipcode | -0.236 | -0.174 | 0.096 | -0.027 | -0.185 | 0.259 | -0.516 | 0.012 | -0.284 | 0.060 | -0.229 | -0.295 | -0.346 | 0.000 | 0.176 | -0.353 | 0.089 | 1.000 |
Missing values
Sample
| bedrooms | bathrooms | sqft_living | sqft_lot | floors | waterfront | view | condition | grade | sqft_above | sqft_basement | yr_built | yr_renovated | zipcode | lat | long | sqft_living15 | price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 3.0 | 1.00 | 1180.0 | 5650.0 | 1.0 | 0.0 | 0.0 | 3.0 | 7.0 | 1180.0 | 0.0 | 1955.0 | 0.0 | 98178.0 | 47.5112 | -122.257 | 1340.0 | 22.190 |
| 1 | 3.0 | 2.25 | 2570.0 | 7242.0 | 2.0 | 0.0 | 0.0 | 3.0 | 7.0 | 2170.0 | 400.0 | 1951.0 | 1991.0 | 98125.0 | 47.7210 | -122.319 | 1690.0 | 53.800 |
| 2 | 2.0 | 1.00 | 770.0 | 10000.0 | 1.0 | 0.0 | 0.0 | 3.0 | 6.0 | 770.0 | 0.0 | 1933.0 | 0.0 | 98028.0 | 47.7379 | -122.233 | 2720.0 | 18.000 |
| 3 | 4.0 | 3.00 | 1960.0 | 5000.0 | 1.0 | 0.0 | 0.0 | 5.0 | 7.0 | 1050.0 | 910.0 | 1965.0 | 0.0 | 98136.0 | 47.5208 | -122.393 | 1360.0 | 60.400 |
| 4 | 3.0 | 2.00 | 1680.0 | 8080.0 | 1.0 | 0.0 | 0.0 | 3.0 | 8.0 | 1680.0 | 0.0 | 1987.0 | 0.0 | 98074.0 | 47.6168 | -122.045 | 1800.0 | 51.000 |
| 5 | 4.0 | 4.50 | 5420.0 | 101930.0 | 1.0 | 0.0 | 0.0 | 3.0 | 11.0 | 3890.0 | 1530.0 | 2001.0 | 0.0 | 98053.0 | 47.6561 | -122.005 | 4760.0 | 123.000 |
| 6 | 3.0 | 2.25 | 1715.0 | 6819.0 | 2.0 | 0.0 | 0.0 | 3.0 | 7.0 | 1715.0 | 0.0 | 1995.0 | 0.0 | 98003.0 | 47.3097 | -122.327 | 2238.0 | 25.750 |
| 7 | 3.0 | 1.50 | 1060.0 | 9711.0 | 1.0 | 0.0 | 0.0 | 3.0 | 7.0 | 1060.0 | 0.0 | 1963.0 | 0.0 | 98198.0 | 47.4095 | -122.315 | 1650.0 | 29.185 |
| 8 | 3.0 | 1.00 | 1780.0 | 7470.0 | 1.0 | 0.0 | 0.0 | 3.0 | 7.0 | 1050.0 | 730.0 | 1960.0 | 0.0 | 98146.0 | 47.5123 | -122.337 | 1780.0 | 22.950 |
| 9 | 3.0 | 2.50 | 1890.0 | 6560.0 | 2.0 | 0.0 | 0.0 | 3.0 | 7.0 | 1890.0 | 0.0 | 2003.0 | 0.0 | 98038.0 | 47.3684 | -122.031 | 2390.0 | 32.300 |
| bedrooms | bathrooms | sqft_living | sqft_lot | floors | waterfront | view | condition | grade | sqft_above | sqft_basement | yr_built | yr_renovated | zipcode | lat | long | sqft_living15 | price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 989 | 3.0 | 2.25 | 1670.0 | 5004.0 | 2.0 | 0.0 | 0.0 | 3.0 | 8.0 | 1670.0 | 0.0 | 1987.0 | 0.0 | 98029.0 | 47.5688 | -122.017 | 1850.0 | 48.495 |
| 990 | 4.0 | 1.75 | 2060.0 | 9828.0 | 1.0 | 0.0 | 0.0 | 4.0 | 8.0 | 2060.0 | 0.0 | 1960.0 | 0.0 | 98005.0 | 47.5867 | -122.174 | 2260.0 | 64.000 |
| 991 | 4.0 | 2.50 | 2160.0 | 8158.0 | 1.0 | 0.0 | 0.0 | 4.0 | 8.0 | 1660.0 | 500.0 | 1952.0 | 0.0 | 98115.0 | 47.6948 | -122.328 | 1520.0 | 58.500 |
| 992 | 4.0 | 2.00 | 2780.0 | 11583.0 | 1.0 | 0.0 | 3.0 | 3.0 | 8.0 | 1190.0 | 1590.0 | 1955.0 | 0.0 | 98125.0 | 47.7293 | -122.284 | 2580.0 | 64.500 |
| 993 | 3.0 | 2.00 | 1490.0 | 7651.0 | 1.0 | 0.0 | 0.0 | 3.0 | 7.0 | 1490.0 | 0.0 | 1988.0 | 0.0 | 98003.0 | 47.3211 | -122.325 | 1590.0 | 25.300 |
| 994 | 2.0 | 1.00 | 740.0 | 6460.0 | 1.0 | 0.0 | 0.0 | 3.0 | 6.0 | 740.0 | 0.0 | 1953.0 | 0.0 | 98146.0 | 47.5077 | -122.344 | 1170.0 | 17.850 |
| 995 | 4.0 | 2.50 | 1860.0 | 6325.0 | 2.0 | 0.0 | 0.0 | 4.0 | 7.0 | 1860.0 | 0.0 | 1991.0 | 0.0 | 98038.0 | 47.3492 | -122.030 | 1860.0 | 29.100 |
| 996 | 2.0 | 2.75 | 1590.0 | 20917.0 | 1.5 | 0.0 | 0.0 | 3.0 | 5.0 | 1590.0 | 0.0 | 1920.0 | 0.0 | 98001.0 | 47.2786 | -122.250 | 1310.0 | 19.995 |
| 997 | 2.0 | 1.00 | 850.0 | 2340.0 | 1.0 | 0.0 | 0.0 | 3.0 | 7.0 | 850.0 | 0.0 | 1922.0 | 0.0 | 98105.0 | 47.6707 | -122.328 | 1300.0 | 55.350 |
| 998 | 2.0 | 1.00 | 1030.0 | 4188.0 | 1.0 | 0.0 | 0.0 | 3.0 | 8.0 | 1030.0 | 0.0 | 1981.0 | 0.0 | 98038.0 | 47.3738 | -122.057 | 1450.0 | 18.995 |